15-712 Project Proposal

نویسندگان

  • Anthony Gitter
  • Alex Grubb
  • Jeffrey Barnes
چکیده

In virtually all scientific disciplines, technological advances over the past years and decades have resulted in an exponential growth in the quantity of raw scientific data available for analysis. Massive datasets, ranging from hundreds of MBs to many TBs, are increasingly common and available to the public. Spanning the breadth of scientific fields, this publicly accessible information includes NASA astrophysics and astronomy collections [1], partial three-dimensional mappings of the universe [2], earthquake and seismic data [3], complete genomes for a multitude of organisms [4], and so on. However, collecting and publishing raw data is only the beginning of scientific understanding. Unfortunately, the enormous benefits to be gained by thoroughly analyzing these datasets is typically matched by the complexity of such analysis. For instance, sequence alignment is a powerful technique for predicting the function of newly discovered genes [5], but computational limitations prevent the best algorithms available from being used on full datasets. Supercomputers and customized high-performance solutions such as the CLC Bioinformatics Cube [6] can sometimes employed, but the cost of such solutions is prohibitive. Furthermore, for some classes of scientific problems involving huge datasets (including those mentioned above), processing power can be largely wasted even on a supercomputer if the computation is disk-bound. With the advent of multi-core personal computers, clusters of commodity machines have the potential to become a reasonable alternative platform for intense scientific computing. Using clusters of readily available machines in place of supercomputers or problem-specific high-performance machines could not only reduce costs, power consumption, and possibly execution time, but it would also open the multitude of rich, massive datasets to a wider community of researchers, conceivably accelerating the rate of scientific discovery.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Remote sensing urban heat-island phenomenon in four Texas cities: San Antonio, Houston, Dallas-Fort Worth, and El Paso

A project proposal is the first step to us to get funding from anywhere. You need to write a proposal to your advisor, your university, your company, your city, state, governmental agencies (USGS, USDA, NSF, NASA, NOAA, DoEd, DOE, ....) for any funding opportunity. A proposal for a small amount of fund is usually 5 pages; for a standard and multi-year proposal to governmental agencies, it is us...

متن کامل

چشمه نور ایران، اولین آزمایشگاه ملی برای تحقیقات بین رشته‌ای

The Iranian Light Source Facility (ILSF) project is the first large scale accelerator facility which is currently under planning in Iran. On the basis of the present design, circumference of the 3 GeV storage ring is 528 m. Beam current and natural beam emittance are 400 mA and 0.477 nm.rad, respectively. Some prototype accelerator components such as high power solid state radio frequency ampli...

متن کامل

Contemporary methods for evaluating complex project proposals

The ability to evaluate project proposals, assessing future success, and organizational value is critical to overall business performance for most enterprises. Yet, predicting project success is difficult and often unreliable. A four-year field study shows that the effectiveness of available methods for evaluating and selecting large, complex project depends on the specific project type, org...

متن کامل

15-712 Systems Final Report Methods for Recognizing Service Quiescence

Our motivation for this project is evaluating our hypothesis that if we had a variety of statistics concerning process resource consumption, we would be able to determine whether and when a process is quiescent.1 We instrument the Linux operating system to allow us to gather relevant statistics, and write a second program, which we call QAnalyzer, that allows us to analyze these statistics, loo...

متن کامل

15-740 Project Milestone Report

The ideal choices for the tasks presented in the project proposal would be the NVIDIA CUDA toolkit (http://developer.nvidia.com/object/cuda. html), which exposes more underlying architecture to programmers. However, the package requires a capable NVIDIA video card, and we could not get for this project. ATI also designed a similar platform “Close-to-Metal (CTM) Device” (http://ati.de/companyinf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007